An Intelligent Approach to Information Retrieval System Using Enhanced DIG and FP-Tree Techniques

نویسندگان

  • P. Janarthanan
  • N. Rajkumar
چکیده

Information retrieval is the process of retrieving all the relevant documents that satisfies the user query from large corpora. It is aimed to provide the relevant information and documents that matches the user query. Outcome of the several research results confirms that difficulties in information retrieval are matching the query with corpus. Consequently, the enhanced indexing technique named Document Index Graph (DIG) used for indexing document collection in order to match and retrieve information efficiently. Hence, an enhanced DIG has been constructed that stores all the stemmed sentences of documents in the graph. The words with same stem can be stored only once in DIG. This helps to reduce the size of the graph. The most frequently appearing words are planted into FP (Frequent Pattern) Tree. The FP-tree is a compact representation of all relevant frequently occurring information in a corpus. The enhanced FP tree with a table generates all types of possible term set which satisfy the minimum support. Information is retrieved with the help of FP-Tree and Document Index Graph. Keyword: Stemming, Document Index Graph, Query Processing, Frequent Pattern Tree and Information Retrieval.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Designing an intelligent system for diagnosing type 2 diabetes using the data mining approach: brief report

Background: Diabetes mellitus has several complications. The Late diagnosis of diabetes in people leads to the spread of complications. Therefore, this study has been done to determine the possibility of predicting diabetes type 2 by using data mining techniques. Methods: This is a descriptive-analytic study that was conducted as a cross-sectional study. The study population included people re...

متن کامل

Behavioral Considerations in Developing Web Information Systems: User-centered Design Agenda

The current paper explores designing a web information retrieval system regarding the searching behavior of users in real and everyday life. Designing an information system that is closely linked to human behavior is equally important for providers and the end users.  From an Information Science point of view, four approaches in designing information retrieval systems were identified as system-...

متن کامل

Experimental Evaluation of Algorithmic Effort Estimation Models using Projects Clustering

One of the most important aspects of software project management is the estimation of cost and time required for running information system. Therefore, software managers try to carry estimation based on behavior, properties, and project restrictions. Software cost estimation refers to the process of development requirement prediction of software system. Various kinds of effort estimation patter...

متن کامل

Review of ranked-based and unranked-based metrics for determining the effectiveness of search engines

Purpose: Traditionally, there have many metrics for evaluating the search engine, nevertheless various researchers’ proposed new metrics in recent years. Aware of this new metrics is essential to conduct research on evaluation of the search engine field. So, the purpose of this study was to provide an analysis of important and new metrics for evaluating the search engines. Methodology: This is ...

متن کامل

Using Data Mining Techniques for Intelligent Diagnosis of Severity of Depressive Disorder

Introduction: Implementing a method that can help individuals diagnose or prevent mental disorders can be an important step in preventing and controlling these disorders especially in the early stages. The objective of this research was to apply data mining techniques for intelligent diagnosis of severity of depressive disorder. Method: The present applied research was carried out by going to a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014